DIMwid - Decoder Inspection for Moses (using Widgets)
نویسندگان
چکیده
The development of accurate machine translation systems requires detailed analyses of the recurring translation mistakes. However, the manual inspection of the decoder log files is a daunting task because of their sheer size and their uncomfortable format, in which the relevant data is widely spread. For all major platforms, DIMwid offers a graphical user interface that allows the quick inspection of the decoder stacks or chart cells for a given span in a uniform way. Currently, DIMwid can process the decoder log files of the phrase-based stack decoder and the syntax-based chart decoder inside the Moses framework.
منابع مشابه
Design of the Moses Decoder for Statistical Machine Translation
We present a description of the implementation of the open source decoder for statistical machine translation which has become popular with many researchers in SMT research. The goal of the project is to create an open, high quality phrase-based decoder which can reduce the time and barrier to entry for researchers wishing to do SMT research. We discuss the major design objective for the Moses ...
متن کاملEdinburgh's Submission to all Tracks of the WMT 2009 Shared Task with Reordering and Speed Improvements to Moses
Edinburgh University participated in the WMT 2009 shared task using the Moses phrase-based statistical machine translation decoder, building systems for all language pairs. The system configuration was identical for all language pairs (with a few additional components for the GermanEnglish language pairs). This paper describes the configuration of the systems, plus novel contributions to Moses ...
متن کاملPipeline-tolerant decoder
This work presents a modified version of the cube pruning decoding algorithm used for machine translation. Our pipeline-tolerant decoding algorithm allows multiple translation hypotheses to be scored at the same time. We demonstrate the benefits of scoring multiple hypotheses simultaneously using a pipeline for language model scoring. Our pipeline builds on top of previous work on memory prefet...
متن کاملExploiting Multiple Resources for Japanese to English Patent Translation
This paper describes the development of a Japanese to English translation system using multiple resources and NTCIR-10 Patent translation collection. The MT system is based on different training data, the Wiktionary as a bilingual dictionary and Moses decoder. Due to the lack of parallel data on the patent domain, additional training data of the general domain was extracted from Wikipedia. Expe...
متن کاملUsing Linear Interpolation and Weighted Reordering Hypotheses in the Moses System
This paper proposes to introduce a novel reordering model in the open-source Moses toolkit. The main idea is to provide weighted reordering hypotheses to the SMT decoder. These hypotheses are built using a first-step Ngram-based SMT translation from a source language into a third representation that is called reordered source language. Each hypothesis has its own weight provided by the Ngram-ba...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Prague Bull. Math. Linguistics
دوره 100 شماره
صفحات -
تاریخ انتشار 2013